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METHOD AND APPARATUS FOR ACCESSING TARGETED, 
PERSONALIZED VOICE/AUDIO WEB CONTENT THROUGH 

WIRELESS DEVICES 

FIELD OF THE INVENTION 
5 The present invention relates to the audio content systems. More specifically, 

one embodiment of the invention provides access to personalized audio content through a 

device with browsing capabilities. 

BACKGROUND OF THE INVENTION 
Initially, cellular telephones were client devices that only connected to a voice 

10 network. Thus, a user of a cellular telephone ("cell phone", for short) could use the device to 
coimect to a voice network, such as the public switched telephone network ("PSTN**), to 
make a circuit connection between the cell phone and another telephone or telephone device. 
Once the connection was made the two devices would communicate using voice band signals. 
Where the two devices were in use by people having conversations, the voice band signals 

15 would simply be the electrical representation of the conversants speaking. In other cases, one 
of the devices was a machine, such as a computing device, a voice mail system, and 
automated response unit ("ARU", also often referred to as a "touch-tone (TM) response 
system"), or voice response unit ("VRU") responsive to voice inputs. 

Recently, with the increasing use of the Intemet, and more specifically the 

20 hyperlinked collection of documents (often referred to as "pages") known as *the World 

Wide Web" ("WWW" or "the Web", for short), newer cell phones often include a Web client 
(often called a "browser") that provides the user of the cell phone access to the Web. In the 
case of wireless access to the Web, of which a cell phone is but one example, there may be 
some limits on the data network usage, such as a limited display and a limited bandwidth 

25 allocated to each device. Thus, in the common usage of the terms, a cell phone user might be 
said to be using the cell phone to browse Web pages on the wireless web. 

As used herein, **wireless web" refers to the Web generally, but with the 
understanding that some modifications might be made to some of the docimients in the 
collection so that the documents are more easily handled in the wireless envirorunent. 

30 Examples of modifications include limitations on the class or number of documents that are 
accessible via the wireless device, modifications to reduce the nvmiber of bits required the 
represent the document, or modifications to make the docimient more readable on a smaller 
display typical of a wireless access device. One simplified approach to wireless web pages is 



to take existing Web pages containing high-bandwidth components such as image and audio 
content and omitting all but the text content of the pages. However, it should be understood 
that such modifications are not a requirement of a set of documents available over a wireless 
web. 

5 The Internet is a packet-switched network, whereas the PSTN is a circuit 

switched network. Nonetheless, newer cell phones can provide both types of connections, as 
the communications channel between the cell phone and the stationary cell transceiver that is 
the cell phone's point of contact is a digital channel. Thus, voice is digitized in the cell phone 
and is transmitted along with any data packet traffic along the channel between the cell phone 

10 and the transceiver. How the data and voice are separated at the transceiver, or further mto 
the cellular telephone service network, is well known and need not be further described 
herein. From the perspective of the cell phone, there is a voice network into which a voice 
circuit can be established and a data network that will accept data packets and retum data 
packets directed at the cell phone. Unless otherwise indicated, it should be understood that 

1 5 references herein to cell phones might also apply to any other device that provides wireless 
access to a voice network and to a data network. 

Typically, the voice connection and the data connection in a wireless access 
device are separate, such that both connections cannot be used at once, either because of 
bandwidth constraints or because a wireless service provider might choose, for billing or 

20 other reasons, not to allow simultaneous use of both connections. However, many devices 
have a capability to switch a user from a data connection to a voice connection more or less 
seamlessly. For example, with some cell phones, the user can use a wireless browser to 
navigate among wireless web pages. 

Navigation from page to page is typically done using links, where one page 

25 contains links to other pages and the user "moves" from the one page to another page by 

selecting (e.g., clicking on) on the one page a link to the other page. Some pages may have 
links not to other pages, but to a voice channel. Instead of the link containing a uniform 
reference locator (**URL") referencing another page, the link contains a reference to a 
telephone number. An example of where this might be useful is where the browser aspect of 

30 the cell phone is used to find a telephone number and then dial that telephone number. If a 
set of wireless web pages were set up to present restaurant choices, for example, the pages 
might include a starting page that leads to a tree of choices that a user could navigate down to 
reach a leaf page of the tree where the leaf page is for a specific restaurant. If that leaf page 
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contained a link that included a telephone number, the user could just select the link and be 
connected to the selected restaurant. 

Since the wireless device does not typically keep both the data connection and 
the voice connection open at the same time, the wireless device would typically have storage 
5 that would store the telephone number from the link. To effect this, once the user selects a 
link that refers to a desired voice connection (i.e., a telephone number of the desired called 
party), the wireless device stores the telephone number, closes the data connection, and then 
sets up a voice connection using the stored telephone number. From the user's perspective, 
the user would use the wireless device to navigate a menu of pages and select a party to call, 

10 and then be connected, via a voice connection, to the called party referenced in the link. At 
that point, the voice connection would be active and the data connection would not be, 
however it should be imderstood that the system could operate similarly in a wireless network 
that allowed simultaneous use of the data and voice connections. 

Some cellular telephone service providers allow for customization, whereby 

15 the user can program their wireless device to speed dial telephone numbers and additional 
digits that are interpreted by a device that answers the call, such as the user's voice mail 
system. Some cellular telephone service providers also allow for customization of the 
wireless web pages accessible to the user. For example, a user might set up pages containing 
links to commonly used telephone nimibers of that user. 

20 SUMMARY OF THE INVENTION 

In a wireless web system according to one embodiment of the present 

invention, users navigate web pages that include links to audio content where the pages are 

provided over a data connection and the audio content is provided over a voice connection. 

In a specific implementation, an audio content reference generator generates a reference to a 

25 portion of static audio content and that audio content reference is provided to the user's 

wireless web client in the form of a link on a wireless web page, or other page retrieved by 
the wireless web device over the data connection. The audio content reference and a 
telephone number of an audio server form the link on the wireless web page, so that when a 
user selects that link, the wireless device establishes a voice connection to the audio server 

30 using the telephone number and then provides the audio server with the audio content 
reference so that the user hears the specifically referenced audio content over the voice 
channel. 
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The audio content reference can be provided to the audio server in many 
different ways. For example, the audio content reference might be provided to the telephone 
network as an extension and the audio server might have caller-ID capability that would 
receive an apparent extension niunber from the telephone company, with the audio server 
5 knowing that a received caller-ID extension should be interpreted as an audio content 

reference. In another approach, the calling wireless device might dial the telephone number, 
wait for the voice circuit connection to be fully established and then transmit the audio 
content reference as a series of tones to an audio server that is programmed to listen for tones 
when a connection is established and the interpret those tones as an audio content reference. 
10 In a third approach, the audio content reference may be stored on the server side and be 
associated with a device identifier when the user browses to the page that initiates the 
telephony call. In this situation, the telephony system uses the device identifier and the 
stored server side information to directly associate the audio content reference with the 
incoming call. 

15 The actual audio content provided by the audio content server in response to a 

reference could be of several forms. One form is static content, such as a recording of movie 
times, or connecting the user directly to voice menus deep within a voice menu structure 
without the user having to manually navigate the voice menu structure. Another form is 
dynamic content, such as an audio listing of the user's new voice mail messages. Yet another 

20 form is interactive content, where the particular audio played for the user depends on further 
inputs provided by the user. Still another form of content is Jive content. An example of live 
content might be where the audio content reference is interpreted by the audio server as a 
request to connect the caller with an internal voice connection, as might be used where the 
user navigates a menu of help pages and selects a link representing a particular technical 

25 problem, resulting in the user's wireless device dialing a telephone number for a help desk 

audio server that then interprets the provided audio content reference as a request to route the 
call to a system or person that can handle the help request. 

A further understanding of the nature and the advantages of the inventions 
disclosed herein may be realized by reference of the remaining portions of the specification 

30 and the attached drawings. 

BRIEF DESCRIPTION OF THE DRAWINGS 
Fig. 1 is a block diagram of a wireless web system according to one 

embodiment of the present invention. 
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Fig. 2 is a flowchart of a process used to provide audio content to a device 
user according to one embodiment of the present invention. 



DESCRIPTION OF THE SPECIFIC EMBODIMENTS 
The wireless web system described herein allows a user to navigate a data 

5 network and use a link to audio content provided over the data network to access audio 

content over a voice network. The audio content can be static content, dynamic content, 

interactive content, live content or another form of audio content. The audio content might 

include audio content alerts (voice mail, email, events, calendar alerts), information in audio 

form, music/radio on demand, a routed telephone call, etc. 

10 Using the wireless web system, a user can access audio content without 

necessarily navigating an audio menu. For example, in order for a user to access a certain 

voice mail, a user may have to dial into the voice mail, enter a password, and pass through 

various voice prompts to receive the specified voice mail. However, according to one 

embodiment of this invention, a user may select a link to that specified voice mail and listen 

15 to the voice mail without taking any further steps. 

Additionally, some audio content can be dynamic and vary depending on the 

state of a system. For example, a user may want to access the state in a voicemail system of 

new messages. A user may select a link to the new messages state and be provided with the 

new messages in the system without any fiirther steps. In addition, a user may be provided 

20 with a prompt that precedes the audio content state. For example, the audio content link may 

bring the user directly to the new messages voice prompt. The user would then be able to 

choose between the options given by the voice prompt. 

Fig. 1 shows a system 10 used to access personal audio/voice content 

according to one embodiment of the invention. In system 10, device 12 is designed to receive 

25 digital content over a data connection fi-om audio reference generator 14 and connect to 

telephony platform 24 over a voice network. 

Device 12 can be any device that has audio/voice capabilities and can browse 

text and/or graphics. In a specific embodiment, device 12 can be enabled to receive digital 

content over a data channel and receive audio/voice content over a voice channel. Some 

30 examples of devices include mobile phones, personal computers, hand-held computers, 

personal digital assistants, etc. 

Audio reference generator 14 can be any device capable of generating a menu 

of items 18 containing audio content links 16. Each audio content link 16 is preferably 
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associated with a telephone number and audio content reference 20. However, in a specific 
embodiment, link 16 can contain just an audio content reference 20. For example, the menu 
of items 18 can be associated with one telephone number and each link 16 can contain just an 
audio content reference. Additionally, a group of menu items can be associated with one 
5 telephone number and another group of menu items associated with another telephone 

number. It is also understood that the number of groups is unlimited. In all cases, if a user 
selects a link in the menu of items 1 8, the telephone number associated with the group of 
menu items 18 is used with the audio content reference 20 associated with the specific link 
16. In this way, a link will only have to contain an audio content reference 20. Additionally, 

10 in a specific embodiment, a device can be programmed to dial a telephony system and 
transmit the audio content reference. 

Menu of items 18 can include a list of selectable links 16 that are navigable by 
the user of the device. Additionally, the menu can be multi-tiered in that a user could select a 
link 16 and be presented with a new list of links related to the selected link. For example, in 

15 a specific embodiment, a user could have both a saved messages and new messages section. 
When the saved messages link is selected, all the saved messages would appear as links on 
the user's device. Furthermore, the user's new messages link would provide links to all the 
new messages when selected. Therefore, in one embodiment of a menu of items 1 8, a user 
can be sent a dbcimient that includes one or more links and can also be multi-tiered. In a 

20 specific embodiment, the menu of items can include HDML ("Hand Held Markup 

Language"), HTML (HyperText Markup Language), or Javascript documents. However, any 
document capable of providing selectable links can be used. 

Once the audio content reference 20 is generated and associated with the audio 
content link, the reference can then be stored in a database with the associated audio content 

25 referenced by the link. In a specific embodiment, an audio content reference can reference a 
specific audio content and be stored in a database 22. In this case, each entry of database 22 
would contain an audio content reference and the associated specific audio content. For 
example, an entry could include an audio content reference that references a specific 
voicemail. 

30 Additionally, in an altemative specific embodiment, an audio content 

reference can reference a state of the system and be stored in state table database 24. In this 
case, an entry would include an audio content reference and the associated audio content in 
the state of the system. Additionally, an entry could include an audio content reference and a 
referencing system pointing to the audio content associated with the state of the system. For 
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example, an audio content reference could represent all the new messages in a state of the 
system. Accordingly, the entry in the database could include all the new messages or include 
references to all the new messages. Preferably, if references to the new messages were 
included, the references would be the specific audio content references for each new 
5 message. Therefore, a central database can be maintained containing all possible audio 
content. The state table can then reference and retrieve the referenced messages fi-om the 
central database. Also, users can customize the state table to include any audio content firom 
the database of audio content references. The user can include selected audio content an 
entry in the state table and only the audio content references need to be stored. It is also 

10 recognized that the state table database 24 and database 22 can be combined into one 
database or be maintained in multiple databases. 

Device 12 can be connected through a voice network to telephony system 24. 
Preferably, a connection through a PSTN (Public Switched Telephone Network) or VOIP 
(Voice Over IP) network is made. In a specific embodiment, telephony system 24 can be the 

15 Yahoo! Inc. Voice Access Telephony Platform. However, any system that can receive an 
audio content reference sent through the voice network can be used. 

In one embodiment, audio content reference 20 can be transmitted to 
telephony system 24 as an extension with the telephone number. In this case, if caller-ID 
were available, the extension would be included in the caller identification. For example, if 

20 the telephone number is "1 8006992466" and audio content reference is "12345", caller 

identification would read in the number "1 800699246612345". Therefore, telephony system 
24 can then read the telephone number and extension fi-om the caller identification. 

In another embodiment, audio content reference 20 can be read in by doing a 
digit collect. In this embodiment, audio content reference 20 would be collected by doing a 

25 digit collect at the time the call is connected. In an alternative embodiment, device 12 can 
connect to telephony system 24 and wait for a signal indicating that audio content reference 
20 should be transmitted. Once system 24 sends the signal, audio content reference 20 can be 
sent and digit collected. Additionally, in another alternative embodiment, system 24 could lie 
progranmied to listen for tones after a connection is made. The audio content reference 20 

30 can then be sent as tones to the system 24. Thus, a system can collect the reference without 
revealing the reference in a caller identification scheme. One potential problem with caller 
identification is an extension would be collected when the phone number is dialed because 
the extension is added to the end of the telephone number. Consequently, a sensitive or 
confidential extension could be collected by a caller identification device. However, 
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sensitive or confidential references can be protected by not including the reference at the end 
of the telephone number. 

The telephone number can also be used to identify the user if caller 
identification is available. The telephone number can be matched to a user database to 
5 determine a userlD of the user. In this case, the system can be customized to a user's 

preferences. Another possible way a caller can be identified is when a user has signed into 
the Yahoo! Mobile service by entering a previously setup YahooID and password. Once 
signed in, the caller can be identified by matching the caller identification with a user 
database to determine the YahooID or userlD of the user. Now, personal information of the 

1 0 user can be presented using the YahooID. Other ways to identify a caller include matching 
the unique ID of the device with a user database to identify a user. Additionally, any other 
relevant information, such as the location of the caller, can also be used to present specific 
audio content to the user. For example, the location of a caller could be used to generate a 
requested weather report fi'om the caller's location 

15 The audio content reference 20 may also be stored on the server side and be 

associated with a device identifier when the user browses to the page that initiates the 
telephony call. Examples of device identifiers are the telephone number of the device, the 
userlD or YahooID, uniquelD of the device, etc. In this situation, the system 24 uses the 
device identifier and the stored server side information to directly associate the audio content 

20 reference with the incoming call. 

Audio server 26 can be any server configured to retrieve content fi'om a 
database. In a specific embodiment, audio server 26 can receive audio content reference 20, 
retrieve the audio content associated with that reference, and return the audio content to 
telephony system 24. Because the desired audio content has been stored in a database with 

25 the audio content reference transmitted by device 12, audio server 26 can retrieve the audio 
content with just the audio content reference. The audio server does not have to 
communicate with any other components of the system other than the databases containing 
the audio content. 

Referring now to Fig. 2, a flow chart shows one series of steps, labeled SI, S2, 
30 etc., in the order most likely to be carried out for the method of retrieving personalized, 
targeted audio content. In stqp SI, device 12 can be provided digital content over a data 
connection to the user. In this step, audio reference generator 14 can produce a document 
containing a link or a plurality of links for that user to click on or select. 
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In step S2, the user selects a link corresponding to the desired audio content. 
The selection of the link makes a call to the telephony system 24 and connects device 12 to 
the telephony system (S3). If the audio content reference was sent as an extension on the end 
of the telephone number (S4), caller identification can collect the telephone number and 
5 extension (S5). The process would then proceed to step (S9). However, if the audio content 
reference was not sent as an extension, device 12 could be waiting for a signal from the 
telephony system to transmit the audio content reference (S6). In this case, device 12 would 
wait for the signal and transmit the audio content reference when the signal is received (S7). 
If the audio content reference were transmitted when device 12 was connected to telephony 

10 system 24, the reference would be read in by a digit collect (S8). The appropriate audio 
content is then retrieved (S9) and presented back to the user (SIO). 

The above description is illustrative and not restrictive. Many variations of 
the invention will become apparent to those who are skilled in the art upon review of this 
disclosure. For example, system 10 can be a wire line system where device 12 is connected 

15 to telephony system 24 and audio reference generator 14 through a modem/DSL connection. 
The only requirement is that device 12 be capable of connecting through a data/voice line. 
The scope of the invention should, therefore, be determined not with reference to the above 
description, but instead should be determined with reference to the appended claims along 
with their fiill scope of equivalence. 
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